Phase transition and landscape statistics of the number partitioning problem 
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The phase transition in the number partitioning problem (NPP), i.e., the transition from a region 
in the space of control parameters in which almost all instances have many solutions to a region in 
which almost all instances have no solution, is investigated by examining the energy landscape of 
this classic optimization problem. This is achieved by coding the information about the minimum 
energy paths connecting pairs of minima into a tree structure, termed a barrier tree, the leaves and 
internal nodes of which represent, respectively, the minima and the lowest energy saddles connecting 
those minima. Here we apply several measures of shape (balance and symmetry) as well as of branch 
lengths (barrier heights) to the barrier trees that result from the landscape of the NPP, aiming at 
identifying traces of the easy/hard transition. We find that it is not possible to tell the easy regime 
from the hard one by visual inspection of the trees or by measuring the barrier heights. Only the 
difficulty measure, given by the maximum value of the ratio between the barrier height and the 
energy surplus of local minima, succeeded in detecting traces of the phase transition in the tree. In 
adddition, we show that the barrier trees associated with the NPP are very similar to random trees, 
contrasting dramatically with trees associated with the p spin-glass and random energy models. We 
also examine critically a recent conjecture on the equivalence between the NPP and a truncated 
random energy model. 



I. INTRODUCTION 

The relevance of the concepts and techniques of sta- 
tistical physics to understanding the typical behavior of 
classes of optimization or decision problems had been 
pointed out by many authors already in the middle of 
the 1980s (see, e.g., Refs. 0,0)- However, it was only 
about ten years later, owing mainly to the finding of a 
ubiquitous peak in computational cost signaling a transi- 
tion between easy and difficult instances of optimization 
problems, that the physics approach has succeeded to 
attract the attention of the computer science community 
(see, e.g., Refs. 0B1&E10)- In particular, instances 
in the phase transition region are now routinely used to 
benchmark algorithms and search heuristics, and so a 
precise location of the critical point in addition to the 
estimate of the width of the transition region has gained 
considerable practical importance. 

The specific optimization problem we consider here is 
the number partitioning problem (NPP), which is one 
of the basic NP-complete problems that form the core 
of the theory of NP-completeness 8] . NPP has an easy 
formulation: Given N not necessarily distinct positive 
numbers a%, . . . , a^v find a subset X C {1, . . . , N} such 
that 



a Mattis-like Ising spin model with Hamiltonian 

H(X) = E 2 (X) = 'i.'ijT.rr, (2) 

hi 

where <x; = +1 if i G X and <7, = — 1 if i £ X 0. It 
is therefore natural to consider E as an energy (cost) 
landscape over the hypercube; in other words, single spin 
flips are a natural way of defining a neighborhood relation 
for the NPP. For concreteness, we will assume from here 
on that the OjS are independent, identically distributed 
random variables that take on integer values between 1 
and I with equal probability. 

A partition 9fi is perfect if E$S) = or 1 for J^. a, even 
or odd, respectively. The existence of perfect partitions 
depends on the accuracy to which the numbers a, are 
determined as well as on the size of the problem N. The 
crucial control parameter here is the ratio between the 
number of bits to which aj is specified and the problem 



N ' 



(3) 



E(X) 



(1) 



is minimized. We remark that NPP can be regarded as 



The relevance of k can be appreciated by considering the 
annealed estimate of the expected number of perf ect par- 
titions, S = 2 N /l, so that k = 1- (log 2 S)/N\U^. Hence 
the annealed theory indicates that for k < 1 there is an 
asymptotically exponential number of perfect partitions, 
while for k > 1 the probability of finding a perfect parti- 
tion is exponentially small. Extensive numerical simula- 
tions and statistical mechanics calculations corroborate 
the value k c — 1 as the threshold separating the easy-to- 
solve from the hard-to-solve regimes [lfl Hl| . 
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Probabilistic and statistical mechanics analyses of the 
ground states of the Hamiltonian J2J in the limit of infi- 
nite precision I — > oo, in which the a^s can be viewed as 
continuous variables, have of course failed to detect the 
phase transition (see, e.g., Refs. |9lll2|>ll3|)- The thermo- 
dynamics for general I was 'solved' under the assumption 
of self-averaging of the random variables a^, but that so- 
lution is not completely satisfactory since it predicts a 
negative entropy for n > k c 11]. Thus a reliable theory 
for the ground states of the NPP, probably based on the 
replica method, has yet to be obtained. 



Generally, the aforementioned phase transition is de- 
fined as a transition from a region in the space of control 
parameters in which almost all instances have many so- 
lutions to a region in which almost all instances have 
no solution 0, 3- Hence, the investigations have relied 
mainly on exhaustive search procedures, such as branch- 
and-bound algorithms, that guarantee the finding of the 
global optima, or on statistical mechanics calculations 
of the expected properties, such as the entropy, of the 
ground states. In this contribution we seek evidence of 
this easy /hard transition in the structure of the cost land- 
scape of the optimization problem, focusing on the distri- 
bution of optima and on the distribution of cost barriers 
between these optima. To this end, we code the informa- 
tion about the paths of minimal cost leading to different 
optima in a tree structure, termed the barrier tree of the 
cost landscape. The leaves of this tree represent the lo- 
cal optima and the internal nodes the lowest-cost saddles 
connecting those optima. Barrier trees have been widely 
used to study proteinEl E EU, RNA [H E3, and 
spin-glass 0, |2(], 0, l22| landscapes. 



We find that the structure of the landscape, as mea- 
sured by the local minima and their connecting saddle 
points, shows surprisingly little difference in the easy and 
hard regimes. The sharp transition between these two 
regimes is revealed only by the difficulty of the landscape, 
a parameter measuring the maximum ratio of energy bar- 
rier to energy gain for the escape from a metastable state, 
which is directly related to the optimal speed of conver- 
gence of simulated annealing [2J, IH |2l| H| |27|. We 
stress that our goal here is not to locate the transition 
point, which can be achieved by simply looking at the 
value of the global energy minimum in an ensemble of 
randomly generated instances (in this sense, a barrier tree 
always contains the information on whether the given in- 
stance is easy or hard) , but to seek evidences of the easy 
and hard regime on other global statistical properties of 
the energy landscape. Furthermore, we examine a re- 
markable conjecture about the equivalence between the 
infinite accuracy version of the NPP and the random cost 
problem [2^, and show that, despite the equivalence at 
the level of the energy distributions, their barrier trees 
are completely different. 




(a) (b) 

FIG. 1: Typical barrier trees for (a) the number partitioning 
problem with infinite precision numbers and (b) the truncated 
random energy model (random cost problem) for N — 10. 

II. BARRIER TREES 

The energy of the lowest saddle point separating two 
local minima x and y is 

E[x,y] = min max£(z) (4) 
pePx H z€p 

where P xy is the set of all paths p connecting x and y by a 
series of subsequent spin-flips 19, 29] . The barrier height 
B(x) of a metastable state x is the minimum height of 
a saddle point that connects it with metastable state y 
with strictly smaller energy E(y) < E(x). In symbols, 

B(x) = min {E\x, y] - E{x)\ y : E(y) < E(x)} . (5) 

Since a direct evaluation of eq. (0J) would require the ex- 
plicit constructions of all possible paths it does not pro- 
vide a feasible algorithm for determining E[x,y] even if 
N is small enough to allow an exhaustive survey of the 
landscape. The values of E[x,y] and B(x) can, however, 
be retrieved from the barrier tree of the landscape. The 
al gori thm for constructing these barrier trees is presented 
in [It], [Ti| (see also Ref. [2(| for a detailed account of the 
algorithm in the spin-glass context). It is implemented 
in the barriers program [3^, which constructs the tree 
from a sorted list of energy values of all spin configura- 
tions in the landscape. In a barrier tree, the leaves of the 
tree represent the local minima, and the internal nodes 
represent the saddles, with the barrier sizes given by the 
length of the branches connecting the local minima to 
their corresponding saddles. Fig. ^ illustrates typical 
barrier trees for the problems considered in this paper. 

III. DEPTH AND DIFFICULTY 

The depth D and difficulty ip of a landscape are mea- 
sures of the landscape structure that are directly related 
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to the performance of simulated annealing |23t 124. l25l 
l2rj |27| . The depth of a landscape is defined as the maxi- 
mum barrier height, D = maxB(x), where the maximum 
is taken over non-global minima only. In particular, it 
can be shown that simulated annealing converges almost 
surely to a ground state if and only if the cooling schedule 
T fe satisfies Y,k>o exp(-D/T k ) = oo 23]. The difficulty 
of the landscape is a dimensionless quantity defined as 



B(x) 



E(x) — _E(min) 



(6) 



where iS(min) is the global energy minimum and, as be- 
fore, the maximum is taken over local minima only. The 
difficulty ip is directly related to the optimal speed of con- 
vergence of simulated annealing. It is more convenient to 
work with the scaled quantity 



A = log 2 (i,/2 N ) = log 2 ^ - N 



(7) 



instead. 

We turn now to the evaluation of the effects on the 
depth and difficulty measures of a change in the accuracy 
of the a^s for a fixed problem size. We find that, similarly 
to many other tree measures discussed in the next sec- 
tion, the depth measure D is independent of the accuracy 
of the djS. The effects on the difficulty measure, on the 
other hand, are striking. Explicitly, in Fig. [21 where each 
symbol represents the result of the average over 100 land- 
scapes, we show that there is a scaling relation between 
the average difficulty and the accuracy of number repre- 
sentation: (A) converges to a unique function of (k— 1)N 
for large N. This scaling function increases linearly for 
k < I and approaches a constant value of about —2.0 for 
k > 1. Hence (A) viewed as a function of k — I exhibits a 
singularity at k = 1 since it increases linearly with a slope 
proportional to N as long as k < 1 and tends towards 
a constant value for k > 1. Therefore the rescaled dif- 
ficulty reflects the phase transition reported in previous 
analyses of the NPP, which have focused on the sing ular 
behavior of the probability of a perfect partition [Tij, [TJJ • 

A simple annealed-like argument to explain the behav- 
ior of the difficulty depicted in Fig. [21 goes as follows. In 
the easy regime we have more partitions (spin configura- 
tions) than different combinations of numbers, thus the 
global optimum will probably be a perfect partition, i.e., 
-Emm = or I, while the lowest metastable state will have 
E = 2 or 3. The height of the barrier separating them, 
however, is essentially the energy of a random configura- 
tion, i.e., B(x) = 0(1), and the maximum barrier height 
will be among the largest numbers in the system, i.e., 
log 2 ip ~ log 2 I. Hence subtracting TV from both sides 
yields A rs (k — 1)N in the easy regime. In the hard 
regime we can only hope to cancel the leading N bits in 
the optimal configuration, thus we expect a ground-state 
energy E m i n ~ l/2 N , while the maximum barrier height 
is again 0(1) yielding log 2 tp w N, which is then indepen- 
dent of k. Of course, these crude estimates miss polyno- 
mial corrections such as the factor N 1 / 2 that appears in 
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FIG. 2: (a) Data collapse of the rescaled average logarithmic 
difficulty (A) as function of the rescaled accuracy (k — 1)N for 
problem sizes N = 12, 1 4, 16, 18 and 20. (b) Detailed data for 
(ft — 1)N = 5 (•), (■), and —5 ((}) confirm the existence of 
the scaling function for large N. 



the rigorous computation of the ground-state energy |12| . 
Note, however, that we are considering log 2 .Emm, i.e., we 
ignore only logarithmic corrections which we would ex- 
pect to arise for more careful estimates of log 2 S(a;) as 
well. As the values of Imp vary significantly among dif- 
ferent landscapes with the same values of I and N it is 
not possible to obtain sufficiently accurate estimates of 
(A) that would reveal such corrections unambiguously. 



IV. MEASURES OF BARRIER TREE SHAPES 

Since a barrier tree embodies all the relevant quanti- 
tative information about the multi- valley structure of an 
energy landscape, it seems natural to ask if there is any 
trace of the easy/hard phase transition in the shape of 
the barrier trees of the NPP landscapes. 

Before introducing the standard measures of tree 
shapes we will consider the effect of the accuracy I on 
probably the main characteristic of a tree, namely, its 
number of leaves or minima n. In fact, it is tempting 
to associate the difficulty of a problem with the number 
of local minima (traps) in its energy landscape represen- 
tation. Surprisingly, however, we find that the average 
fraction of local minima does not change with I as soon 
as I > N, i.e., as soon as it becomes unlikely that ad- 
jacent configurations have the same energy, see Fig. [3 
For very small values of / the number of local optima de- 
pends strongly on how degenerate neighbors are treated 
(data not shown). The important point here is that the 
fraction of minima stays constant across the easy /hard 
transition and, in particular, it is given by the formula 



(8) 



which was obtained in the limit of infinite accuracy [13| . 

Now we consider five measures of tree shape that were 
originally used to study phylogenetic trees (see, e.g., Refs. 
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FIG. 4: None of the measures of the tree-shape shown here 
reflect the easy/hard transition of the NPP (a) imblance C, 
(b) inverse subtree height B%, (c) weighted leaf- height E>2- 
The data for N = 12(T), 14(A), 16(f), 18(B), and 20(«) are 
averages over 100 landscapes. 



FIG. 3: Average fraction of local minima (n)/2 JV as a func- 
tion of the rescaled accuracy (k — 1)N for (top to bottom) 
N = 12, 14, 16, 18 and 20. Each symbol is the result of an 
average over 200 landscapes and the horizontal lines are the 
theoretical predictions for the limit I — > oo. 



[30l l3lT|) for the barrier trees resulting from NPP land- 
scapes. These measures provide statistical information 
about the shape of the barrier tree, mainly its symme- 
try or balance, and ignore the branch lengths, i.e., the 
height of the barriers between minima, which were the 
object of the depth and difficult measures. Recently, we 
showed that these measures are capable of distinguishing 
between p-spin models with different values of p |22| . 

Let d(i,j) be the graph-theoretical distance between 
two nodes of the tree, i.e., the number of edges along 
the path that connects them. Furthermore, we denote 
the root of the tree by 0. The height of a leaf k is 
hk = d(0,k). Equivalcntly, hk is the number of inter- 
nal nodes between leaf k and the root (inclusive). For 
each interior node i we have two subtrees with r, and 
Si leaves, respectively. We assume fj > Sj. The subtree- 
height of an interior node i is m, = maxfcg^ d(i, k) where 
the maximum is taken over all leaves k in the subtree Tj 
below i, i.e., the subtree of which i is the root. 

With this notation we may define the following five 
characteristic values for the shape of a binary rooted tree: 

1. H = — X)fc=i ^fe i s the average height of a leaf in 
the tree. 



2. uh — y \ J2k=i (^ fe ~~ Hy ', is the standard devia- 
tion of the leaf height. 



3. C 



n(»-3)+2 S"=i ( r * ~ s ») is a measure for the 
imbalance of trees with n > 2. 

4. Bi = Xi,:/0 V m, i i s the average inverse subtree 
height, where the sum is taken over all n — 2 in- 
ternal nodes i excluding the root 0. 

5. B2 = Sfe=i ^~ hk hk is an alternatively weighted av- 
erage leaf-height. 



The physical meaning of H and <jh is clear. We men- 
tion only that for random trees (e.g ., trees produced by 
the neutral genealogical process 32] ) the expected value 
of H increases as Inn. In addition, an = for a com- 
pletely symmetric tree. The imbalance measure C assigns 
a weight proportional to the the number of leaves to each 
one of the two subtrees branching out an internal node. 
These weight differences are then averaged and normal- 
ized over all internal nodes of the tree. The value of C 
increases from for a completely symmetric tree to 1 for a 
completely asymmetric tree. The statistic B\ looks at the 
longest possible path rrn from each internal node i to any 
of the leaves in its subtree. The statistic B2 is based on 
an index of information content. For highly asymmetric 
trees, such as those produced by p-spin landscapes [22| . 
it will quickly converge to the value B2 = 2. Both B\ 
and B2 have smaller values for increasingly asymmetric 
trees. 

Somewhat surprisingly, neither of these measures ex- 
hibits a non-trivial dependence on k, as shown in Fig. 01 
for the measures C, B\ and i?2- The other quantities H 
and oh behave analogously but have a larger scatter for 
large N. Therefore it is impossible to tell the easy from 
the hard regime by visual inspection of the barrier tree or 
by simply measuring branch lengths, as done in the depth 
measure. The only effective measure involves a nontrivial 
balance between branch lengths and leaf energies. 



V. TRUNCATED RANDOM ENERGY MODEL 

A very interesting though poorly explored finding con- 
cerns the equivalence between the NPP in the limit of 
infinite precision, where the at become real random vari- 
ables distributed uniformly in the unit interval, and the 
symmetrized truncated random energy model (REM) or 
random cost problem [2^]. In particular, for large prob- 
lem sizes the energies of two or more distinct configura- 
tions of the NPP become statistically independent and 
the M = 2 N ~ X distinct energies values for unconstrained 
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partitions are distributed according to 

^> = db cxp (-^S) e(£) (9) 

where /loo = (a 2 ) = 1/3 is the second moment of the ay in 
the corresponding number partitioning problem. Hence 
the bold claim that the NPP is essentially equivalent to a 
truncated REM . The main application of Eq. @ is 
the derivation of the probability density of the minimum 
energy E m { n using trite arguments of extreme statistics 

m, 

p (E min ) = MP (0) exp [-MP (0) E min ] , (10) 

from which the expected minimum energy follows triv- 
ially, 

(E min ) = N- 1 ' 2 2- N (11) 

in a gree ment with the known numerical |13| and analyt- 
ical [HI results for the NPP. 

However, in order for the equivalence at the level of the 
energy distribution between the NPP and the truncated 
REM to have any use in guiding the design of search 
heuristics for the NPP, it is important that other features 
of the two problems, such as their multi- valley structures, 
are similar too. In fact, a glance at Fig. ^ is already 
sufficient to reveal the deep structural difference between 
the barrier trees of these problems, and the remainder of 
this section is aimed at quantifying these differences. 

The expected number of minima can be easily cal- 
culated for any random energy model with finite prob- 
ability density over the reals 13- The argument 
goes as follows. Fix an arbitrary spin configuration 
a = ((7i, o"2, . . . , <7zv) and consider all its N neighbors. 
Since we assign to each spin configuration a random en- 
ergy value drawn from the continuous distribution 
we conclude that (i) all these N + 1 energies are distinct 
with probability 1; and (ii) E(a) is the smallest of the 
N+l numbers with a probability of l/(N +1). Hence a 
is a local minimum with probability 1/(N + 1) and so the 
fraction of local minima is 

(n)/2 N = 1/(N+1) (12) 

which, for large N, is larger than the result for the NPP 
[see Eq.©] by a factor of order of TV 1 / 2 . 

In Fig. El we present the tree size (n) dependence of 
the barrier tree measures C and H for the NPP and the 
symmetrized truncated REM. A useful standard here, 
also shown in this figure, is the random trees, generated 
as follows. First, create n nodes (the leaves) and put 
them in a set A. Next, remove two random nodes x and 
y from A, create a new node z and make x and y its two 
children, and put z in the set A. Repeat this procedure 
until there is only one node left in A, which will be the 
root of the tree. Random trees are important from the 
biological viewpoint because they arise from the neutral 
genealogical process j3^|. The symmetrized truncated 
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FIG. 5: Tree balance measures C and H as functions of the 
tree size n for the symmetrized truncated REM (a), the NPP 
(•) and random trees (<(>). The solid lines are numerical fit- 
tings: H = 0.4n and C = 0.6 + 0.1 In In (n) for the REM, and 
H = -1 + 21n(n) and C = 5/n for the NPP and random 
trees. 

REM presents the same scaling on n as the p-spin mod- 
els, namely, H ~ n and C ~ In In (n) within the range of 
n considered Of course, since C6 [0, 1] this scaling 
cannot be valid for n — > oo, but the double logarith- 
mic dependence guarantees its validity for a very large 
range of tree sizes. Actually, as far as the five statistics 
introduced in Sec. IIVI are concerned, there is no signi- 
ficative differences between the symmetrized truncated 
REM and the standard REM, both models producing 
then extremely unbalanced trees. These measures, how- 
ever, differ dramatically between the NPP and all pre- 
vious spin-glass models analysed Surprisingly, the 
NPP barrier trees are practically indistinguishable from 
random trees and, in particular, the tree measures obey 
the same scaling relation with the tree size, H ~ In (n) 
and C ~ 1/n. Hence these trees, in stark contrast with 
the p-spin model and REM barrier trees, become more 
and more balanced as n increases. 

Since the equivalence between NPP and the truncated 
REM discussed above was proved only in the case of in- 
finite precision numbers (i.e., k — > oo) it provides no 
clues about the easy/hard phase transition that takes 
place at k = 1. Nevertheless, it is natural to ask if the 
integer counterpart of the truncated REM, obtained by 
considering the integer-valued energies E' = [E\ where 
E is drawn from the distribution @ with fi^ replaced 
by [ii = 1(1 + 1) (2Z + l)/6, exhibits a phase transition. 
This can easily be answered by calculating the probabil- 
ity that a partition is perfect (i.e., E' min = or 1) which 
is given by 

^perf = / d E mm P (E m in) = 1 - ffltp (-2£) (13) 

Jo 

where £ = 2 N / (2tt[hN) 1/2 . Although one clearly recov- 
ers the easy (-P per f ~ 1) and the hard (-P per f ~ 0) regimes 
depending on whether the ratio (log 2 l)/N is very small 
or very large, respectively, there is definitely no phase 
transition separating them. 
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VI. CONCLUSION 

Phase transitions in physical systems are characterized 
by the appearance of singularities in some observables, 
known as the order parameters of the system, such as, 
e.g., the gas density in the boiling transition. In the case 
of mean-field spin-glass models the order parameter di- 
rectly reflects the hierarchical organization of pure states 
in a complex multi- valley structure . Therefore one ex- 
pects that some features of that structure must undergo 
abrupt changes when the critical point is approached. 
Unfortunately, the vast majority of the phase transitions 
take place at finite temperature, while a direct study 
of the landscape properties of spin-glass models based 
on natural quantities, such as saddle points and min- 
ima, is feasible at zero temperature only. In that sense, 
the easy/hard phase transitions in optimization problems 
in general, and in the number partitioning problem 
(NPP) [13, HH considered here, provide a unique chance 
to study how the onset of the phase transition affects the 
organization of the metastable states of a disordered spin 
system. 

Somewhat surprisingly, we find that almost all fea- 
tures of the landscape, which we have properly mapped 
into a tree structure through the barriers program, are 
insensitive to the onset of the easy/hard phase transi- 
tion that takes place when the number of bits needed 
to specify a number a,i equals the problem size, i.e., 
k = (log 2 l)/N = 1. Interestingly, only one of the 
measures studied, premonitorily termed difficulty in the 
mathematical literature of simulated annealing j2J], ex- 
hibits a singular behavior at the critical point. As a re- 
sult, the quality of the optima found by simulated anneal- 



ing will probably depend strongly on whether the control 
parameters set the instance in the easy {k < 1) or hard 
(k > 1) regime. 

An important by-product of our study of the NPP 
landscape is the finding that the resulting barrier trees 
are very similar to random trees, and so they become 
completely balanced (symmetric) in the limit of large 
system sizes N or, equivalently, large tree sizes n. These 
trees contrast drastically with the barriers trees result- 
ing from the p spin-glass, the random energy or the sym- 
metrized truncated random energy landscapes, which be- 
come completely unbalanced in that limit. In this con- 
text, we note that although there is an equivalence at the 
level of the energy distribution between the NPP and the 
truncated random energy model, the statistical proper- 
ties of their energy landscapes are very different, and 
probably so are the performances of local search heuris- 
tics in finding near-global solutions to these problems. 
Actually, the similarity of the NPP barrier trees with 
random trees may be part of the explanation for the fail- 
ure of local search techniques to produce good solutions 
to this optimization problem. 

Acknowledgments 

Thanks to Christoph Flamm at the University of Vienna, 
Austria, for his help with the barriers program. This re- 
search was supported by Fundagao de Amparo a Pesquisa do 
Estado de Sao Paulo (FAPESP), project 99/09644-9. The 
work of J.F.F. is supported in part by CNPq and WH is sup- 
ported by FAPESP. P.F.S. gratefully acknowledges the hospi- 
tality of the Instituto de Fi'sica de Sao Carlos in Nov. 2002. 



[1] P. W. Anderson, Physica A 140, 405 (1986). 

[2] M. Mezard, G. Parisi and M. A. Virasoro, Spin Glass 

Theory and Beyond (World Scientific, Singapore, 1987). 
[3] S. Kirkpatrick and B. Selman, Science 264, 1297 (1994). 
[4] T. Hogg, B. A. Huberman and C. P. Williams, Artif. 

Intell. 81, 1 (1996). 
[5] B. Hayes, Am. Sci. 85, 108 (1997). 

[6] R. Monasson, R. Zecchina, S. Kirkpatrick, B. Selman and 

L. Troyansky, Nature 400, 133 (1999). 
[7] O. Martin, R. Monasson and R. Zecchina, Theor. Comp. 

Sci. 265, 3 (2001). 
[8] M. R. Garey and D. S. Johnson, Computers and In- 
tractability: A Guide to the Theory of NP-Completeness 
(Freeman, New York, 1979). 
[9] Y. Fu, in Lectures in the Science of Complexity edited by 
D. L. Stein (Addison- Wesley, Reading, 1989). 
[10] I. P. Gent and T. Walsh, Comput. Intell. 14, 430 (1998). 
[11] S. Mertens, Phys. Rev. Lett. 81, 4281 (1998). 
[12] N. Karmarkar, R. M. Karp, G. S. Lueker, and A. M. 

Odlyzko, J. Appl. Probab. 23, 626 (1986). 
[13] F. F. Ferreira and J. F. Fontanari, J. Phys. A 31, 3417 

(1998); Physica A 269, 58 (1999). 
[14] O. M. Becker and M. Karplus, J. Chem. Phys. 106, 1495 



(1997). 

[15] D. J. Wales, M. A. Miller and T. R. Walsh, Nature 394, 
758 (1998). 

[16] P. Garstecki, T. X. Hoang and M. Cieplak, Phys. Rev. E 

60, 3219 (1999). 
[17] C. Flamm, W. Fontana, I. L. Hofacker and P. Schuster, 

RNA 6, 325 (2000). 
[18] C. Flamm, I. L. Hofacker, P. F. Stadler and M. T. Wolfin- 

ger, Z. Phys. Chem. 216, 155 (2002). 
[19] K. Nemoto, J. Phys. A 21, L287 (1988). 
[20] F. F. Ferreira, J. F. Fontanari and P. F. Stadler, J. Phys. 

A 33, 8635 (2000). 
[21] J. F. Fontanari and P. F. Stadler, J. Phys. A 35, 1509 

(2002). 

[22] W. Hordijk , J. F. Fontanari, and P. F. Stadler, e-print 

cond-mat/0302041 
[23] B. Hajek, Math. Operations Res. 13, 311 (1988). 
[24] O. Catoni, Ann. Probab. 20, 1109 (1992). 
[25] W. Kern, Discr. Appl. Math. 43, 115 (1993) 
[26] J. Ryan, Discr. Appl. Math. 56, 75 (1995). 
[27] O. Catoni, Lecture Notes in Mathematics 709, 69 (1999). 
[28] S. Mertens, Phys. Rev. Lett. 84, 1347 (2000). 
[29] A. M. Vertechi and M. A. Virasoro, J. Phys. France 50, 



7 



2325 (1989). 

[30] K-T. Shao and R. R. Sokal, Syst. Zool. 39, 266 (1990). 
[31] M. Kirkpatrick and M. Slatkin, Evolution 47, 1171 
(1993). 

[32] J. Felsenstein, Inferring Phytogenies (Sinauer Associates, 
Sunderland, 2002). 



[33] B. Derrida, Phys. Rev. B 24, 2613 (1981). 
[34] D. J. Gross and M. Mezard, Nucl. Phys. B 240, 431 
(1984). 

[35] The source code is available at 
http : / /www . tbi . univie . ac . at/~ivo/RNA/Barriers/. 



